Dataset statistics
| Before imputation | After imputation | |
|---|---|---|
| Number of variables | 16 | 14 |
| Number of observations | 8693 | 8693 |
| Missing cells | 2323 | 0 |
| Missing cells (%) | 1.7% | 0.0% |
| Duplicate rows | 0 | 504 |
| Duplicate rows (%) | 0.0% | 5.8% |
| Total size in memory | 1.1 MiB | 832.1 KiB |
| Average record size in memory | 128.0 B | 98.0 B |
Variable types
| Before imputation | After imputation | |
|---|---|---|
| Categorical | 5 | 4 |
| Boolean | 2 | 2 |
| Numeric | 9 | 8 |
| Before imputation | After imputation | |
|---|---|---|
VIP is highly imbalanced (84.0%) | VIP is highly imbalanced (84.3%) | Imbalance |
HomePlanet has 201 (2.3%) missing values | Alert not present in this dataset | Missing |
CryoSleep has 217 (2.5%) missing values | Alert not present in this dataset | Missing |
Destination has 182 (2.1%) missing values | Alert not present in this dataset | Missing |
Age has 179 (2.1%) missing values | Alert not present in this dataset | Missing |
VIP has 203 (2.3%) missing values | Alert not present in this dataset | Missing |
RoomService has 181 (2.1%) missing values | Alert not present in this dataset | Missing |
FoodCourt has 183 (2.1%) missing values | Alert not present in this dataset | Missing |
ShoppingMall has 208 (2.4%) missing values | Alert not present in this dataset | Missing |
Spa has 183 (2.1%) missing values | Alert not present in this dataset | Missing |
VRDeck has 188 (2.2%) missing values | Alert not present in this dataset | Missing |
Cabin_deck has 199 (2.3%) missing values | Alert not present in this dataset | Missing |
Cabin_side has 199 (2.3%) missing values | Alert not present in this dataset | Missing |
Age has 178 (2.0%) zeros | Age has 178 (2.0%) zeros | Zeros |
RoomService has 5577 (64.2%) zeros | RoomService has 5651 (65.0%) zeros | Zeros |
FoodCourt has 5456 (62.8%) zeros | FoodCourt has 5533 (63.6%) zeros | Zeros |
ShoppingMall has 5587 (64.3%) zeros | ShoppingMall has 5692 (65.5%) zeros | Zeros |
Spa has 5324 (61.2%) zeros | Spa has 5393 (62.0%) zeros | Zeros |
VRDeck has 5495 (63.2%) zeros | VRDeck has 5576 (64.1%) zeros | Zeros |
| Alert not present in this dataset | Dataset has 504 (5.8%) duplicate rows | Duplicates |
Reproduction
| Before imputation | After imputation | |
|---|---|---|
| Analysis started | 2024-04-23 18:25:49.887940 | 2024-04-23 18:26:06.761971 |
| Analysis finished | 2024-04-23 18:26:06.749186 | 2024-04-23 18:26:19.481524 |
| Duration | 16.86 seconds | 12.72 seconds |
| Software version | ydata-profiling vv4.7.0 | ydata-profiling vv4.7.0 |
| Download configuration | config.json | config.json |
HomePlanet
Categorical
| Before imputation | After imputation | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 201 | 0 |
| Missing (%) | 2.3% | 0.0% |
| Memory size | 68.0 KiB | 68.0 KiB |
| Earth | |
|---|---|
| Europa | |
| Mars |
| Earth | |
|---|---|
| Europa | |
| Mars |
Length
| Before imputation | After imputation | |
|---|---|---|
| Max length | 6 | 6 |
| Median length | 5 | 5 |
| Mean length | 5.0438059 | 5.0421028 |
| Min length | 4 | 4 |
Characters and Unicode
| Before imputation | After imputation | |
|---|---|---|
| Total characters | 42832 | 43831 |
| Distinct characters | 10 | 10 |
| Distinct categories | 1 | 1 ? |
| Distinct scripts | 1 | 1 ? |
| Distinct blocks | 1 | 1 ? |
Unique
| Before imputation | After imputation | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Before imputation | After imputation | |
|---|---|---|
| 1st row | Europa | Europa |
| 2nd row | Earth | Earth |
| 3rd row | Europa | Europa |
| 4th row | Europa | Europa |
| 5th row | Earth | Earth |
Common Values
| Value | Count | Frequency (%) |
| Earth | 4602 | |
| Europa | 2131 | |
| Mars | 1759 | 20.2% |
| (Missing) | 201 | 2.3% |
| Value | Count | Frequency (%) |
| Earth | 4709 | |
| Europa | 2175 | |
| Mars | 1809 | 20.8% |
Length
Common Values (Plot)
Before imputation
After imputation
| Value | Count | Frequency (%) |
| earth | 4602 | |
| europa | 2131 | |
| mars | 1759 | 20.7% |
| Value | Count | Frequency (%) |
| earth | 4709 | |
| europa | 2175 | |
| mars | 1809 | 20.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8492 | |
| r | 8492 | |
| E | 6733 | |
| t | 4602 | |
| h | 4602 | |
| u | 2131 | 5.0% |
| o | 2131 | 5.0% |
| p | 2131 | 5.0% |
| M | 1759 | 4.1% |
| s | 1759 | 4.1% |
| Value | Count | Frequency (%) |
| a | 8693 | |
| r | 8693 | |
| E | 6884 | |
| t | 4709 | |
| h | 4709 | |
| u | 2175 | 5.0% |
| o | 2175 | 5.0% |
| p | 2175 | 5.0% |
| M | 1809 | 4.1% |
| s | 1809 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 42832 |
| Value | Count | Frequency (%) |
| (unknown) | 43831 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 8492 | |
| r | 8492 | |
| E | 6733 | |
| t | 4602 | |
| h | 4602 | |
| u | 2131 | 5.0% |
| o | 2131 | 5.0% |
| p | 2131 | 5.0% |
| M | 1759 | 4.1% |
| s | 1759 | 4.1% |
| Value | Count | Frequency (%) |
| a | 8693 | |
| r | 8693 | |
| E | 6884 | |
| t | 4709 | |
| h | 4709 | |
| u | 2175 | 5.0% |
| o | 2175 | 5.0% |
| p | 2175 | 5.0% |
| M | 1809 | 4.1% |
| s | 1809 | 4.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 42832 |
| Value | Count | Frequency (%) |
| (unknown) | 43831 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 8492 | |
| r | 8492 | |
| E | 6733 | |
| t | 4602 | |
| h | 4602 | |
| u | 2131 | 5.0% |
| o | 2131 | 5.0% |
| p | 2131 | 5.0% |
| M | 1759 | 4.1% |
| s | 1759 | 4.1% |
| Value | Count | Frequency (%) |
| a | 8693 | |
| r | 8693 | |
| E | 6884 | |
| t | 4709 | |
| h | 4709 | |
| u | 2175 | 5.0% |
| o | 2175 | 5.0% |
| p | 2175 | 5.0% |
| M | 1809 | 4.1% |
| s | 1809 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 42832 |
| Value | Count | Frequency (%) |
| (unknown) | 43831 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 8492 | |
| r | 8492 | |
| E | 6733 | |
| t | 4602 | |
| h | 4602 | |
| u | 2131 | 5.0% |
| o | 2131 | 5.0% |
| p | 2131 | 5.0% |
| M | 1759 | 4.1% |
| s | 1759 | 4.1% |
| Value | Count | Frequency (%) |
| a | 8693 | |
| r | 8693 | |
| E | 6884 | |
| t | 4709 | |
| h | 4709 | |
| u | 2175 | 5.0% |
| o | 2175 | 5.0% |
| p | 2175 | 5.0% |
| M | 1809 | 4.1% |
| s | 1809 | 4.1% |
CryoSleep
Boolean
| Before imputation | After imputation | |
|---|---|---|
| Distinct | 2 | 2 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 217 | 0 |
| Missing (%) | 2.5% | 0.0% |
| Memory size | 68.0 KiB | 8.6 KiB |
| False | |
|---|---|
| True | |
| (Missing) | 217 |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 5439 | |
| True | 3037 | |
| (Missing) | 217 | 2.5% |
| Value | Count | Frequency (%) |
| False | 5571 | |
| True | 3122 |
Before imputation
After imputation
Destination
Categorical
| Before imputation | After imputation | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 182 | 0 |
| Missing (%) | 2.1% | 0.0% |
| Memory size | 68.0 KiB | 68.0 KiB |
| TRAPPIST-1e | |
|---|---|
| 55 Cancri e | |
| PSO J318.5-22 |
| TRAPPIST-1e | |
|---|---|
| 55 Cancri e | |
| PSO J318.5-22 |
Length
| Before imputation | After imputation | |
|---|---|---|
| Max length | 13 | 13 |
| Median length | 11 | 11 |
| Mean length | 11.187052 | 11.183136 |
| Min length | 11 | 11 |
Characters and Unicode
| Before imputation | After imputation | |
|---|---|---|
| Total characters | 95213 | 97215 |
| Distinct characters | 23 | 23 |
| Distinct categories | 1 | 1 ? |
| Distinct scripts | 1 | 1 ? |
| Distinct blocks | 1 | 1 ? |
Unique
| Before imputation | After imputation | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Before imputation | After imputation | |
|---|---|---|
| 1st row | TRAPPIST-1e | TRAPPIST-1e |
| 2nd row | TRAPPIST-1e | TRAPPIST-1e |
| 3rd row | TRAPPIST-1e | TRAPPIST-1e |
| 4th row | TRAPPIST-1e | TRAPPIST-1e |
| 5th row | TRAPPIST-1e | TRAPPIST-1e |
Common Values
| Value | Count | Frequency (%) |
| TRAPPIST-1e | 5915 | |
| 55 Cancri e | 1800 | 20.7% |
| PSO J318.5-22 | 796 | 9.2% |
| (Missing) | 182 | 2.1% |
| Value | Count | Frequency (%) |
| TRAPPIST-1e | 6086 | |
| 55 Cancri e | 1811 | 20.8% |
| PSO J318.5-22 | 796 | 9.2% |
Length
Common Values (Plot)
Before imputation
After imputation
| Value | Count | Frequency (%) |
| trappist-1e | 5915 | |
| 55 | 1800 | 13.9% |
| cancri | 1800 | 13.9% |
| e | 1800 | 13.9% |
| pso | 796 | 6.2% |
| j318.5-22 | 796 | 6.2% |
| Value | Count | Frequency (%) |
| trappist-1e | 6086 | |
| 55 | 1811 | 13.8% |
| cancri | 1811 | 13.8% |
| e | 1811 | 13.8% |
| pso | 796 | 6.1% |
| j318.5-22 | 796 | 6.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 12626 | |
| T | 11830 | |
| e | 7715 | 8.1% |
| S | 6711 | 7.0% |
| - | 6711 | 7.0% |
| 1 | 6711 | 7.0% |
| A | 5915 | 6.2% |
| I | 5915 | 6.2% |
| R | 5915 | 6.2% |
| 5 | 4396 | 4.6% |
| Other values (13) | 20768 |
| Value | Count | Frequency (%) |
| P | 12968 | |
| T | 12172 | |
| e | 7897 | 8.1% |
| S | 6882 | 7.1% |
| - | 6882 | 7.1% |
| 1 | 6882 | 7.1% |
| A | 6086 | 6.3% |
| I | 6086 | 6.3% |
| R | 6086 | 6.3% |
| 5 | 4418 | 4.5% |
| Other values (13) | 20856 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 95213 |
| Value | Count | Frequency (%) |
| (unknown) | 97215 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| P | 12626 | |
| T | 11830 | |
| e | 7715 | 8.1% |
| S | 6711 | 7.0% |
| - | 6711 | 7.0% |
| 1 | 6711 | 7.0% |
| A | 5915 | 6.2% |
| I | 5915 | 6.2% |
| R | 5915 | 6.2% |
| 5 | 4396 | 4.6% |
| Other values (13) | 20768 |
| Value | Count | Frequency (%) |
| P | 12968 | |
| T | 12172 | |
| e | 7897 | 8.1% |
| S | 6882 | 7.1% |
| - | 6882 | 7.1% |
| 1 | 6882 | 7.1% |
| A | 6086 | 6.3% |
| I | 6086 | 6.3% |
| R | 6086 | 6.3% |
| 5 | 4418 | 4.5% |
| Other values (13) | 20856 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 95213 |
| Value | Count | Frequency (%) |
| (unknown) | 97215 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| P | 12626 | |
| T | 11830 | |
| e | 7715 | 8.1% |
| S | 6711 | 7.0% |
| - | 6711 | 7.0% |
| 1 | 6711 | 7.0% |
| A | 5915 | 6.2% |
| I | 5915 | 6.2% |
| R | 5915 | 6.2% |
| 5 | 4396 | 4.6% |
| Other values (13) | 20768 |
| Value | Count | Frequency (%) |
| P | 12968 | |
| T | 12172 | |
| e | 7897 | 8.1% |
| S | 6882 | 7.1% |
| - | 6882 | 7.1% |
| 1 | 6882 | 7.1% |
| A | 6086 | 6.3% |
| I | 6086 | 6.3% |
| R | 6086 | 6.3% |
| 5 | 4418 | 4.5% |
| Other values (13) | 20856 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 95213 |
| Value | Count | Frequency (%) |
| (unknown) | 97215 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| P | 12626 | |
| T | 11830 | |
| e | 7715 | 8.1% |
| S | 6711 | 7.0% |
| - | 6711 | 7.0% |
| 1 | 6711 | 7.0% |
| A | 5915 | 6.2% |
| I | 5915 | 6.2% |
| R | 5915 | 6.2% |
| 5 | 4396 | 4.6% |
| Other values (13) | 20768 |
| Value | Count | Frequency (%) |
| P | 12968 | |
| T | 12172 | |
| e | 7897 | 8.1% |
| S | 6882 | 7.1% |
| - | 6882 | 7.1% |
| 1 | 6882 | 7.1% |
| A | 6086 | 6.3% |
| I | 6086 | 6.3% |
| R | 6086 | 6.3% |
| 5 | 4418 | 4.5% |
| Other values (13) | 20856 |
Age
Real number (ℝ)
| Before imputation | After imputation | |
|---|---|---|
| Distinct | 80 | 204 |
| Distinct (%) | 0.9% | 2.3% |
| Missing | 179 | 0 |
| Missing (%) | 2.1% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 28.82793 | 28.83621 |
| Before imputation | After imputation | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 79 | 79 |
| Zeros | 178 | 178 |
| Zeros (%) | 2.0% | 2.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 68.0 KiB | 68.0 KiB |
Quantile statistics
| Before imputation | After imputation | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 4 | 4 |
| Q1 | 19 | 20 |
| median | 27 | 27 |
| Q3 | 38 | 37 |
| 95-th percentile | 56 | 55 |
| Maximum | 79 | 79 |
| Range | 79 | 79 |
| Interquartile range (IQR) | 19 | 17 |
Descriptive statistics
| Before imputation | After imputation | |
|---|---|---|
| Standard deviation | 14.489021 | 14.361932 |
| Coefficient of variation (CV) | 0.50260359 | 0.49805199 |
| Kurtosis | 0.10193292 | 0.14809039 |
| Mean | 28.82793 | 28.83621 |
| Median Absolute Deviation (MAD) | 9 | 9 |
| Skewness | 0.41909658 | 0.41898548 |
| Sum | 245441 | 250673.17 |
| Variance | 209.93174 | 206.26508 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 24 | 324 | 3.7% |
| 18 | 320 | 3.7% |
| 21 | 311 | 3.6% |
| 19 | 293 | 3.4% |
| 23 | 292 | 3.4% |
| 22 | 291 | 3.3% |
| 20 | 277 | 3.2% |
| 26 | 268 | 3.1% |
| 28 | 267 | 3.1% |
| 27 | 259 | 3.0% |
| Other values (70) | 5612 |
| Value | Count | Frequency (%) |
| 24 | 324 | 3.7% |
| 18 | 320 | 3.7% |
| 21 | 311 | 3.6% |
| 19 | 293 | 3.4% |
| 23 | 292 | 3.4% |
| 22 | 291 | 3.3% |
| 20 | 277 | 3.2% |
| 26 | 268 | 3.1% |
| 28 | 267 | 3.1% |
| 27 | 259 | 3.0% |
| Other values (194) | 5791 |
| Value | Count | Frequency (%) |
| 0 | 178 | |
| 1 | 67 | 0.8% |
| 2 | 75 | |
| 3 | 75 | |
| 4 | 71 | 0.8% |
| 5 | 33 | 0.4% |
| 6 | 40 | 0.5% |
| 7 | 52 | 0.6% |
| 8 | 46 | 0.5% |
| 9 | 42 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 178 | |
| 1 | 67 | 0.8% |
| 2 | 75 | |
| 3 | 75 | |
| 4 | 71 | 0.8% |
| 5 | 33 | 0.4% |
| 6 | 40 | 0.5% |
| 7 | 52 | 0.6% |
| 8 | 46 | 0.5% |
| 8.451165929 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 178 | |
| 1 | 67 | 0.8% |
| 2 | 75 | |
| 3 | 75 | |
| 4 | 71 | 0.8% |
| 5 | 33 | 0.4% |
| 6 | 40 | 0.5% |
| 7 | 52 | 0.6% |
| 8 | 46 | 0.5% |
| 8.451165929 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 178 | |
| 1 | 67 | 0.8% |
| 2 | 75 | |
| 3 | 75 | |
| 4 | 71 | 0.8% |
| 5 | 33 | 0.4% |
| 6 | 40 | 0.5% |
| 7 | 52 | 0.6% |
| 8 | 46 | 0.5% |
| 9 | 42 | 0.5% |
VIP
Boolean
| Before imputation | After imputation | |
|---|---|---|
| Distinct | 2 | 2 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 203 | 0 |
| Missing (%) | 2.3% | 0.0% |
| Memory size | 68.0 KiB | 8.6 KiB |
| False | |
|---|---|
| True | 199 |
| (Missing) | 203 |
| False | |
|---|---|
| True | 198 |
| Value | Count | Frequency (%) |
| False | 8291 | |
| True | 199 | 2.3% |
| (Missing) | 203 | 2.3% |
| Value | Count | Frequency (%) |
| False | 8495 | |
| True | 198 | 2.3% |
Before imputation
After imputation
RoomService
Real number (ℝ)
| Before imputation | After imputation | |
|---|---|---|
| Distinct | 1273 | 1380 |
| Distinct (%) | 15.0% | 15.9% |
| Missing | 181 | 0 |
| Missing (%) | 2.1% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 224.68762 | 224.53839 |
| Before imputation | After imputation | |
|---|---|---|
| Minimum | 0 | -232.32033 |
| Maximum | 14327 | 14327 |
| Zeros | 5577 | 5651 |
| Zeros (%) | 64.2% | 65.0% |
| Negative | 0 | 10 |
| Negative (%) | 0.0% | 0.1% |
| Memory size | 68.0 KiB | 68.0 KiB |
Quantile statistics
| Before imputation | After imputation | |
|---|---|---|
| Minimum | 0 | -232.32033 |
| 5-th percentile | 0 | 0 |
| Q1 | 0 | 0 |
| median | 0 | 0 |
| Q3 | 47 | 56 |
| 95-th percentile | 1274.25 | 1267.4 |
| Maximum | 14327 | 14327 |
| Range | 14327 | 14559.32 |
| Interquartile range (IQR) | 47 | 56 |
Descriptive statistics
| Before imputation | After imputation | |
|---|---|---|
| Standard deviation | 666.71766 | 661.83779 |
| Coefficient of variation (CV) | 2.9673093 | 2.9475485 |
| Kurtosis | 65.273802 | 65.865938 |
| Mean | 224.68762 | 224.53839 |
| Median Absolute Deviation (MAD) | 0 | 0 |
| Skewness | 6.3330141 | 6.3474445 |
| Sum | 1912541 | 1951912.2 |
| Variance | 444512.44 | 438029.26 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5577 | |
| 1 | 117 | 1.3% |
| 2 | 79 | 0.9% |
| 3 | 61 | 0.7% |
| 4 | 47 | 0.5% |
| 5 | 28 | 0.3% |
| 9 | 25 | 0.3% |
| 8 | 24 | 0.3% |
| 6 | 24 | 0.3% |
| 14 | 21 | 0.2% |
| Other values (1263) | 2509 | |
| (Missing) | 181 | 2.1% |
| Value | Count | Frequency (%) |
| 0 | 5651 | |
| 1 | 117 | 1.3% |
| 2 | 79 | 0.9% |
| 3 | 61 | 0.7% |
| 4 | 47 | 0.5% |
| 5 | 28 | 0.3% |
| 9 | 25 | 0.3% |
| 8 | 24 | 0.3% |
| 6 | 24 | 0.3% |
| 14 | 21 | 0.2% |
| Other values (1370) | 2616 |
| Value | Count | Frequency (%) |
| 0 | 5577 | |
| 1 | 117 | 1.3% |
| 2 | 79 | 0.9% |
| 3 | 61 | 0.7% |
| 4 | 47 | 0.5% |
| 5 | 28 | 0.3% |
| 6 | 24 | 0.3% |
| 7 | 17 | 0.2% |
| 8 | 24 | 0.3% |
| 9 | 25 | 0.3% |
| Value | Count | Frequency (%) |
| -232.3203344 | 1 | |
| -95.69703395 | 1 | |
| -85.73948362 | 1 | |
| -69.5474572 | 1 | |
| -43.18417992 | 1 | |
| -27.72422618 | 1 | |
| -17.40291265 | 1 | |
| -15.59733242 | 1 | |
| -6.729029689 | 1 | |
| -1.542926493 | 1 |
| Value | Count | Frequency (%) |
| -232.3203344 | 1 | |
| -95.69703395 | 1 | |
| -85.73948362 | 1 | |
| -69.5474572 | 1 | |
| -43.18417992 | 1 | |
| -27.72422618 | 1 | |
| -17.40291265 | 1 | |
| -15.59733242 | 1 | |
| -6.729029689 | 1 | |
| -1.542926493 | 1 |
| Value | Count | Frequency (%) |
| 0 | 5577 | |
| 1 | 117 | 1.3% |
| 2 | 79 | 0.9% |
| 3 | 61 | 0.7% |
| 4 | 47 | 0.5% |
| 5 | 28 | 0.3% |
| 6 | 24 | 0.3% |
| 7 | 17 | 0.2% |
| 8 | 24 | 0.3% |
| 9 | 25 | 0.3% |
FoodCourt
Real number (ℝ)
| Before imputation | After imputation | |
|---|---|---|
| Distinct | 1507 | 1613 |
| Distinct (%) | 17.7% | 18.6% |
| Missing | 183 | 0 |
| Missing (%) | 2.1% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 458.0772 | 456.12828 |
| Before imputation | After imputation | |
|---|---|---|
| Minimum | 0 | -330.2559 |
| Maximum | 29813 | 29813 |
| Zeros | 5456 | 5533 |
| Zeros (%) | 62.8% | 63.6% |
| Negative | 0 | 9 |
| Negative (%) | 0.0% | 0.1% |
| Memory size | 68.0 KiB | 68.0 KiB |
Quantile statistics
| Before imputation | After imputation | |
|---|---|---|
| Minimum | 0 | -330.2559 |
| 5-th percentile | 0 | 0 |
| Q1 | 0 | 0 |
| median | 0 | 0 |
| Q3 | 76 | 86 |
| 95-th percentile | 2748.5 | 2749.4679 |
| Maximum | 29813 | 29813 |
| Range | 29813 | 30143.256 |
| Interquartile range (IQR) | 76 | 86 |
Descriptive statistics
| Before imputation | After imputation | |
|---|---|---|
| Standard deviation | 1611.4892 | 1600.4771 |
| Coefficient of variation (CV) | 3.5179425 | 3.5088312 |
| Kurtosis | 73.30723 | 73.861567 |
| Mean | 458.0772 | 456.12828 |
| Median Absolute Deviation (MAD) | 0 | 0 |
| Skewness | 7.1022279 | 7.1170725 |
| Sum | 3898237 | 3965123.2 |
| Variance | 2596897.6 | 2561527 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5456 | |
| 1 | 116 | 1.3% |
| 2 | 75 | 0.9% |
| 3 | 53 | 0.6% |
| 4 | 53 | 0.6% |
| 5 | 33 | 0.4% |
| 6 | 31 | 0.4% |
| 9 | 28 | 0.3% |
| 7 | 27 | 0.3% |
| 10 | 27 | 0.3% |
| Other values (1497) | 2611 | |
| (Missing) | 183 | 2.1% |
| Value | Count | Frequency (%) |
| 0 | 5533 | |
| 1 | 116 | 1.3% |
| 2 | 75 | 0.9% |
| 3 | 53 | 0.6% |
| 4 | 53 | 0.6% |
| 5 | 33 | 0.4% |
| 6 | 31 | 0.4% |
| 9 | 28 | 0.3% |
| 7 | 27 | 0.3% |
| 10 | 27 | 0.3% |
| Other values (1603) | 2717 |
| Value | Count | Frequency (%) |
| 0 | 5456 | |
| 1 | 116 | 1.3% |
| 2 | 75 | 0.9% |
| 3 | 53 | 0.6% |
| 4 | 53 | 0.6% |
| 5 | 33 | 0.4% |
| 6 | 31 | 0.4% |
| 7 | 27 | 0.3% |
| 8 | 20 | 0.2% |
| 9 | 28 | 0.3% |
| Value | Count | Frequency (%) |
| -330.2559002 | 1 | < 0.1% |
| -252.2289842 | 1 | < 0.1% |
| -198.7968009 | 1 | < 0.1% |
| -160.326611 | 1 | < 0.1% |
| -84.50866249 | 1 | < 0.1% |
| -81.76534395 | 1 | < 0.1% |
| -30.4882525 | 1 | < 0.1% |
| -14.52259282 | 1 | < 0.1% |
| -8.169382772 | 1 | < 0.1% |
| 0 | 5533 |
| Value | Count | Frequency (%) |
| -330.2559002 | 1 | < 0.1% |
| -252.2289842 | 1 | < 0.1% |
| -198.7968009 | 1 | < 0.1% |
| -160.326611 | 1 | < 0.1% |
| -84.50866249 | 1 | < 0.1% |
| -81.76534395 | 1 | < 0.1% |
| -30.4882525 | 1 | < 0.1% |
| -14.52259282 | 1 | < 0.1% |
| -8.169382772 | 1 | < 0.1% |
| 0 | 5533 |
| Value | Count | Frequency (%) |
| 0 | 5456 | |
| 1 | 116 | 1.3% |
| 2 | 75 | 0.9% |
| 3 | 53 | 0.6% |
| 4 | 53 | 0.6% |
| 5 | 33 | 0.4% |
| 6 | 31 | 0.4% |
| 7 | 27 | 0.3% |
| 8 | 20 | 0.2% |
| 9 | 28 | 0.3% |
ShoppingMall
Real number (ℝ)
| Before imputation | After imputation | |
|---|---|---|
| Distinct | 1115 | 1218 |
| Distinct (%) | 13.1% | 14.0% |
| Missing | 208 | 0 |
| Missing (%) | 2.4% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 173.72917 | 173.10661 |
| Before imputation | After imputation | |
|---|---|---|
| Minimum | 0 | -162.15064 |
| Maximum | 23492 | 23492 |
| Zeros | 5587 | 5692 |
| Zeros (%) | 64.3% | 65.5% |
| Negative | 0 | 10 |
| Negative (%) | 0.0% | 0.1% |
| Memory size | 68.0 KiB | 68.0 KiB |
Quantile statistics
| Before imputation | After imputation | |
|---|---|---|
| Minimum | 0 | -162.15064 |
| 5-th percentile | 0 | 0 |
| Q1 | 0 | 0 |
| median | 0 | 0 |
| Q3 | 27 | 31 |
| 95-th percentile | 927.8 | 926 |
| Maximum | 23492 | 23492 |
| Range | 23492 | 23654.151 |
| Interquartile range (IQR) | 27 | 31 |
Descriptive statistics
| Before imputation | After imputation | |
|---|---|---|
| Standard deviation | 604.69646 | 599.20486 |
| Coefficient of variation (CV) | 3.4806847 | 3.4614788 |
| Kurtosis | 328.87091 | 333.04009 |
| Mean | 173.72917 | 173.10661 |
| Median Absolute Deviation (MAD) | 0 | 0 |
| Skewness | 12.627562 | 12.67925 |
| Sum | 1474092 | 1504815.8 |
| Variance | 365657.81 | 359046.46 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5587 | |
| 1 | 153 | 1.8% |
| 2 | 80 | 0.9% |
| 3 | 59 | 0.7% |
| 4 | 45 | 0.5% |
| 5 | 38 | 0.4% |
| 7 | 36 | 0.4% |
| 6 | 34 | 0.4% |
| 13 | 29 | 0.3% |
| 8 | 28 | 0.3% |
| Other values (1105) | 2396 | |
| (Missing) | 208 | 2.4% |
| Value | Count | Frequency (%) |
| 0 | 5692 | |
| 1 | 153 | 1.8% |
| 2 | 80 | 0.9% |
| 3 | 59 | 0.7% |
| 4 | 45 | 0.5% |
| 5 | 38 | 0.4% |
| 7 | 36 | 0.4% |
| 6 | 34 | 0.4% |
| 13 | 29 | 0.3% |
| 9 | 28 | 0.3% |
| Other values (1208) | 2499 |
| Value | Count | Frequency (%) |
| 0 | 5587 | |
| 1 | 153 | 1.8% |
| 2 | 80 | 0.9% |
| 3 | 59 | 0.7% |
| 4 | 45 | 0.5% |
| 5 | 38 | 0.4% |
| 6 | 34 | 0.4% |
| 7 | 36 | 0.4% |
| 8 | 28 | 0.3% |
| 9 | 28 | 0.3% |
| Value | Count | Frequency (%) |
| -162.1506372 | 1 | |
| -137.3657573 | 1 | |
| -114.561574 | 1 | |
| -113.5403653 | 1 | |
| -64.68670373 | 1 | |
| -55.26010686 | 1 | |
| -54.86229795 | 1 | |
| -44.01136393 | 1 | |
| -29.99700539 | 1 | |
| -22.36626173 | 1 |
| Value | Count | Frequency (%) |
| -162.1506372 | 1 | |
| -137.3657573 | 1 | |
| -114.561574 | 1 | |
| -113.5403653 | 1 | |
| -64.68670373 | 1 | |
| -55.26010686 | 1 | |
| -54.86229795 | 1 | |
| -44.01136393 | 1 | |
| -29.99700539 | 1 | |
| -22.36626173 | 1 |
| Value | Count | Frequency (%) |
| 0 | 5587 | |
| 1 | 153 | 1.8% |
| 2 | 80 | 0.9% |
| 3 | 59 | 0.7% |
| 4 | 45 | 0.5% |
| 5 | 38 | 0.4% |
| 6 | 34 | 0.4% |
| 7 | 36 | 0.4% |
| 8 | 28 | 0.3% |
| 9 | 28 | 0.3% |
Spa
Real number (ℝ)
| Before imputation | After imputation | |
|---|---|---|
| Distinct | 1327 | 1441 |
| Distinct (%) | 15.6% | 16.6% |
| Missing | 183 | 0 |
| Missing (%) | 2.1% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 311.13878 | 311.25812 |
| Before imputation | After imputation | |
|---|---|---|
| Minimum | 0 | -67.375759 |
| Maximum | 22408 | 22408 |
| Zeros | 5324 | 5393 |
| Zeros (%) | 61.2% | 62.0% |
| Negative | 0 | 5 |
| Negative (%) | 0.0% | 0.1% |
| Memory size | 68.0 KiB | 68.0 KiB |
Quantile statistics
| Before imputation | After imputation | |
|---|---|---|
| Minimum | 0 | -67.375759 |
| 5-th percentile | 0 | 0 |
| Q1 | 0 | 0 |
| median | 0 | 0 |
| Q3 | 59 | 71 |
| 95-th percentile | 1607.1 | 1611.4 |
| Maximum | 22408 | 22408 |
| Range | 22408 | 22475.376 |
| Interquartile range (IQR) | 59 | 71 |
Descriptive statistics
| Before imputation | After imputation | |
|---|---|---|
| Standard deviation | 1136.7055 | 1127.664 |
| Coefficient of variation (CV) | 3.6533715 | 3.6229224 |
| Kurtosis | 81.20211 | 82.114 |
| Mean | 311.13878 | 311.25812 |
| Median Absolute Deviation (MAD) | 0 | 0 |
| Skewness | 7.6360199 | 7.6630794 |
| Sum | 2647791 | 2705766.9 |
| Variance | 1292099.5 | 1271626.2 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5324 | |
| 1 | 146 | 1.7% |
| 2 | 105 | 1.2% |
| 3 | 53 | 0.6% |
| 5 | 53 | 0.6% |
| 4 | 46 | 0.5% |
| 7 | 34 | 0.4% |
| 6 | 33 | 0.4% |
| 9 | 28 | 0.3% |
| 8 | 28 | 0.3% |
| Other values (1317) | 2660 | |
| (Missing) | 183 | 2.1% |
| Value | Count | Frequency (%) |
| 0 | 5393 | |
| 1 | 146 | 1.7% |
| 2 | 105 | 1.2% |
| 5 | 53 | 0.6% |
| 3 | 53 | 0.6% |
| 4 | 46 | 0.5% |
| 7 | 34 | 0.4% |
| 6 | 33 | 0.4% |
| 8 | 28 | 0.3% |
| 9 | 28 | 0.3% |
| Other values (1431) | 2774 |
| Value | Count | Frequency (%) |
| 0 | 5324 | |
| 1 | 146 | 1.7% |
| 2 | 105 | 1.2% |
| 3 | 53 | 0.6% |
| 4 | 46 | 0.5% |
| 5 | 53 | 0.6% |
| 6 | 33 | 0.4% |
| 7 | 34 | 0.4% |
| 8 | 28 | 0.3% |
| 9 | 28 | 0.3% |
| Value | Count | Frequency (%) |
| -67.3757589 | 1 | < 0.1% |
| -10.48259565 | 1 | < 0.1% |
| -6.989195279 | 1 | < 0.1% |
| -6.506341041 | 1 | < 0.1% |
| -0.184145847 | 1 | < 0.1% |
| 0 | 5393 | |
| 1 | 146 | 1.7% |
| 2 | 105 | 1.2% |
| 3 | 53 | 0.6% |
| 4 | 46 | 0.5% |
| Value | Count | Frequency (%) |
| -67.3757589 | 1 | < 0.1% |
| -10.48259565 | 1 | < 0.1% |
| -6.989195279 | 1 | < 0.1% |
| -6.506341041 | 1 | < 0.1% |
| -0.184145847 | 1 | < 0.1% |
| 0 | 5393 | |
| 1 | 146 | 1.7% |
| 2 | 105 | 1.2% |
| 3 | 53 | 0.6% |
| 4 | 46 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 5324 | |
| 1 | 146 | 1.7% |
| 2 | 105 | 1.2% |
| 3 | 53 | 0.6% |
| 4 | 46 | 0.5% |
| 5 | 53 | 0.6% |
| 6 | 33 | 0.4% |
| 7 | 34 | 0.4% |
| 8 | 28 | 0.3% |
| 9 | 28 | 0.3% |
VRDeck
Real number (ℝ)
| Before imputation | After imputation | |
|---|---|---|
| Distinct | 1306 | 1413 |
| Distinct (%) | 15.4% | 16.3% |
| Missing | 188 | 0 |
| Missing (%) | 2.2% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 304.85479 | 303.54529 |
| Before imputation | After imputation | |
|---|---|---|
| Minimum | 0 | -129.47032 |
| Maximum | 24133 | 24133 |
| Zeros | 5495 | 5576 |
| Zeros (%) | 63.2% | 64.1% |
| Negative | 0 | 9 |
| Negative (%) | 0.0% | 0.1% |
| Memory size | 68.0 KiB | 68.0 KiB |
Quantile statistics
| Before imputation | After imputation | |
|---|---|---|
| Minimum | 0 | -129.47032 |
| 5-th percentile | 0 | 0 |
| Q1 | 0 | 0 |
| median | 0 | 0 |
| Q3 | 46 | 52.407262 |
| 95-th percentile | 1534.2 | 1514 |
| Maximum | 24133 | 24133 |
| Range | 24133 | 24262.47 |
| Interquartile range (IQR) | 46 | 52.407262 |
Descriptive statistics
| Before imputation | After imputation | |
|---|---|---|
| Standard deviation | 1145.7172 | 1136.936 |
| Coefficient of variation (CV) | 3.7582391 | 3.7455236 |
| Kurtosis | 86.011186 | 86.886046 |
| Mean | 304.85479 | 303.54529 |
| Median Absolute Deviation (MAD) | 0 | 0 |
| Skewness | 7.8197316 | 7.8470745 |
| Sum | 2592790 | 2638719.2 |
| Variance | 1312667.9 | 1292623.6 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5495 | |
| 1 | 139 | 1.6% |
| 2 | 70 | 0.8% |
| 3 | 56 | 0.6% |
| 5 | 51 | 0.6% |
| 4 | 47 | 0.5% |
| 6 | 32 | 0.4% |
| 8 | 30 | 0.3% |
| 7 | 29 | 0.3% |
| 9 | 25 | 0.3% |
| Other values (1296) | 2531 | |
| (Missing) | 188 | 2.2% |
| Value | Count | Frequency (%) |
| 0 | 5576 | |
| 1 | 139 | 1.6% |
| 2 | 70 | 0.8% |
| 3 | 56 | 0.6% |
| 5 | 51 | 0.6% |
| 4 | 47 | 0.5% |
| 6 | 32 | 0.4% |
| 8 | 30 | 0.3% |
| 7 | 29 | 0.3% |
| 9 | 25 | 0.3% |
| Other values (1403) | 2638 |
| Value | Count | Frequency (%) |
| 0 | 5495 | |
| 1 | 139 | 1.6% |
| 2 | 70 | 0.8% |
| 3 | 56 | 0.6% |
| 4 | 47 | 0.5% |
| 5 | 51 | 0.6% |
| 6 | 32 | 0.4% |
| 7 | 29 | 0.3% |
| 8 | 30 | 0.3% |
| 9 | 25 | 0.3% |
| Value | Count | Frequency (%) |
| -129.4703202 | 1 | < 0.1% |
| -123.0460894 | 1 | < 0.1% |
| -102.4264273 | 1 | < 0.1% |
| -72.22024974 | 1 | < 0.1% |
| -71.60520892 | 1 | < 0.1% |
| -37.93002098 | 1 | < 0.1% |
| -29.98420841 | 1 | < 0.1% |
| -23.63495899 | 1 | < 0.1% |
| -6.684265716 | 1 | < 0.1% |
| 0 | 5576 |
| Value | Count | Frequency (%) |
| -129.4703202 | 1 | < 0.1% |
| -123.0460894 | 1 | < 0.1% |
| -102.4264273 | 1 | < 0.1% |
| -72.22024974 | 1 | < 0.1% |
| -71.60520892 | 1 | < 0.1% |
| -37.93002098 | 1 | < 0.1% |
| -29.98420841 | 1 | < 0.1% |
| -23.63495899 | 1 | < 0.1% |
| -6.684265716 | 1 | < 0.1% |
| 0 | 5576 |
| Value | Count | Frequency (%) |
| 0 | 5495 | |
| 1 | 139 | 1.6% |
| 2 | 70 | 0.8% |
| 3 | 56 | 0.6% |
| 4 | 47 | 0.5% |
| 5 | 51 | 0.6% |
| 6 | 32 | 0.4% |
| 7 | 29 | 0.3% |
| 8 | 30 | 0.3% |
| 9 | 25 | 0.3% |
Transported
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 68.0 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8693 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 4378 | |
| 0 | 4315 |
Length
| Value | Count | Frequency (%) |
| 1 | 4378 | |
| 0 | 4315 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 4378 | |
| 0 | 4315 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8693 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 4378 | |
| 0 | 4315 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8693 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 4378 | |
| 0 | 4315 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8693 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 4378 | |
| 0 | 4315 |
Cabin_deck
Categorical
| Before imputation | After imputation | |
|---|---|---|
| Distinct | 8 | 8 |
| Distinct (%) | 0.1% | 0.1% |
| Missing | 199 | 0 |
| Missing (%) | 2.3% | 0.0% |
| Memory size | 68.0 KiB | 68.0 KiB |
| F | |
|---|---|
| G | |
| E | |
| B | |
| C | |
| Other values (3) |
| F | |
|---|---|
| G | |
| E | |
| B | |
| C | |
| Other values (3) |
Length
| Before imputation | After imputation | |
|---|---|---|
| Max length | 1 | 1 |
| Median length | 1 | 1 |
| Mean length | 1 | 1 |
| Min length | 1 | 1 |
Characters and Unicode
| Before imputation | After imputation | |
|---|---|---|
| Total characters | 8494 | 8693 |
| Distinct characters | 8 | 8 |
| Distinct categories | 1 | 1 ? |
| Distinct scripts | 1 | 1 ? |
| Distinct blocks | 1 | 1 ? |
Unique
| Before imputation | After imputation | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Before imputation | After imputation | |
|---|---|---|
| 1st row | B | B |
| 2nd row | F | F |
| 3rd row | A | A |
| 4th row | A | A |
| 5th row | F | F |
Common Values
| Value | Count | Frequency (%) |
| F | 2794 | |
| G | 2559 | |
| E | 876 | 10.1% |
| B | 779 | 9.0% |
| C | 747 | 8.6% |
| D | 478 | 5.5% |
| A | 256 | 2.9% |
| T | 5 | 0.1% |
| (Missing) | 199 | 2.3% |
| Value | Count | Frequency (%) |
| F | 2868 | |
| G | 2615 | |
| E | 877 | 10.1% |
| B | 819 | 9.4% |
| C | 768 | 8.8% |
| D | 483 | 5.6% |
| A | 258 | 3.0% |
| T | 5 | 0.1% |
Length
Common Values (Plot)
Before imputation
After imputation
| Value | Count | Frequency (%) |
| f | 2794 | |
| g | 2559 | |
| e | 876 | 10.3% |
| b | 779 | 9.2% |
| c | 747 | 8.8% |
| d | 478 | 5.6% |
| a | 256 | 3.0% |
| t | 5 | 0.1% |
| Value | Count | Frequency (%) |
| f | 2868 | |
| g | 2615 | |
| e | 877 | 10.1% |
| b | 819 | 9.4% |
| c | 768 | 8.8% |
| d | 483 | 5.6% |
| a | 258 | 3.0% |
| t | 5 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 2794 | |
| G | 2559 | |
| E | 876 | 10.3% |
| B | 779 | 9.2% |
| C | 747 | 8.8% |
| D | 478 | 5.6% |
| A | 256 | 3.0% |
| T | 5 | 0.1% |
| Value | Count | Frequency (%) |
| F | 2868 | |
| G | 2615 | |
| E | 877 | 10.1% |
| B | 819 | 9.4% |
| C | 768 | 8.8% |
| D | 483 | 5.6% |
| A | 258 | 3.0% |
| T | 5 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8494 |
| Value | Count | Frequency (%) |
| (unknown) | 8693 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| F | 2794 | |
| G | 2559 | |
| E | 876 | 10.3% |
| B | 779 | 9.2% |
| C | 747 | 8.8% |
| D | 478 | 5.6% |
| A | 256 | 3.0% |
| T | 5 | 0.1% |
| Value | Count | Frequency (%) |
| F | 2868 | |
| G | 2615 | |
| E | 877 | 10.1% |
| B | 819 | 9.4% |
| C | 768 | 8.8% |
| D | 483 | 5.6% |
| A | 258 | 3.0% |
| T | 5 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8494 |
| Value | Count | Frequency (%) |
| (unknown) | 8693 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| F | 2794 | |
| G | 2559 | |
| E | 876 | 10.3% |
| B | 779 | 9.2% |
| C | 747 | 8.8% |
| D | 478 | 5.6% |
| A | 256 | 3.0% |
| T | 5 | 0.1% |
| Value | Count | Frequency (%) |
| F | 2868 | |
| G | 2615 | |
| E | 877 | 10.1% |
| B | 819 | 9.4% |
| C | 768 | 8.8% |
| D | 483 | 5.6% |
| A | 258 | 3.0% |
| T | 5 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8494 |
| Value | Count | Frequency (%) |
| (unknown) | 8693 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| F | 2794 | |
| G | 2559 | |
| E | 876 | 10.3% |
| B | 779 | 9.2% |
| C | 747 | 8.8% |
| D | 478 | 5.6% |
| A | 256 | 3.0% |
| T | 5 | 0.1% |
| Value | Count | Frequency (%) |
| F | 2868 | |
| G | 2615 | |
| E | 877 | 10.1% |
| B | 819 | 9.4% |
| C | 768 | 8.8% |
| D | 483 | 5.6% |
| A | 258 | 3.0% |
| T | 5 | 0.1% |
Cabin_side
Categorical
| Before imputation | After imputation | |
|---|---|---|
| Distinct | 2 | 2 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 199 | 0 |
| Missing (%) | 2.3% | 0.0% |
| Memory size | 68.0 KiB | 68.0 KiB |
| S | |
|---|---|
| P |
| S | |
|---|---|
| P |
Length
| Before imputation | After imputation | |
|---|---|---|
| Max length | 1 | 1 |
| Median length | 1 | 1 |
| Mean length | 1 | 1 |
| Min length | 1 | 1 |
Characters and Unicode
| Before imputation | After imputation | |
|---|---|---|
| Total characters | 8494 | 8693 |
| Distinct characters | 2 | 2 |
| Distinct categories | 1 | 1 ? |
| Distinct scripts | 1 | 1 ? |
| Distinct blocks | 1 | 1 ? |
Unique
| Before imputation | After imputation | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Before imputation | After imputation | |
|---|---|---|
| 1st row | P | P |
| 2nd row | S | S |
| 3rd row | S | S |
| 4th row | S | S |
| 5th row | S | S |
Common Values
| Value | Count | Frequency (%) |
| S | 4288 | |
| P | 4206 | |
| (Missing) | 199 | 2.3% |
| Value | Count | Frequency (%) |
| S | 4387 | |
| P | 4306 |
Length
Common Values (Plot)
Before imputation
After imputation
| Value | Count | Frequency (%) |
| s | 4288 | |
| p | 4206 |
| Value | Count | Frequency (%) |
| s | 4387 | |
| p | 4306 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 4288 | |
| P | 4206 |
| Value | Count | Frequency (%) |
| S | 4387 | |
| P | 4306 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8494 |
| Value | Count | Frequency (%) |
| (unknown) | 8693 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| S | 4288 | |
| P | 4206 |
| Value | Count | Frequency (%) |
| S | 4387 | |
| P | 4306 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8494 |
| Value | Count | Frequency (%) |
| (unknown) | 8693 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| S | 4288 | |
| P | 4206 |
| Value | Count | Frequency (%) |
| S | 4387 | |
| P | 4306 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8494 |
| Value | Count | Frequency (%) |
| (unknown) | 8693 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| S | 4288 | |
| P | 4206 |
| Value | Count | Frequency (%) |
| S | 4387 | |
| P | 4306 |
ID_group
Real number (ℝ)
| Distinct | 6217 |
|---|---|
| Distinct (%) | 71.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4633.3896 |
| Minimum | 1 |
|---|---|
| Maximum | 9280 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 465.6 |
| Q1 | 2319 |
| median | 4630 |
| Q3 | 6883 |
| 95-th percentile | 8819.4 |
| Maximum | 9280 |
| Range | 9279 |
| Interquartile range (IQR) | 4564 |
Descriptive statistics
| Standard deviation | 2671.0289 |
|---|---|
| Coefficient of variation (CV) | 0.57647404 |
| Kurtosis | -1.1817463 |
| Mean | 4633.3896 |
| Median Absolute Deviation (MAD) | 2277 |
| Skewness | 0.0020202219 |
| Sum | 40278056 |
| Variance | 7134395.1 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 4498 | 8 | 0.1% |
| 8168 | 8 | 0.1% |
| 8728 | 8 | 0.1% |
| 8796 | 8 | 0.1% |
| 8956 | 8 | 0.1% |
| 4256 | 8 | 0.1% |
| 984 | 8 | 0.1% |
| 9081 | 8 | 0.1% |
| 8988 | 8 | 0.1% |
| 5756 | 8 | 0.1% |
| Other values (6207) | 8613 |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
| 3 | 2 | |
| 4 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 2 | |
| 7 | 1 | < 0.1% |
| 8 | 3 | |
| 9 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9280 | 2 | |
| 9279 | 1 | < 0.1% |
| 9278 | 1 | < 0.1% |
| 9276 | 1 | < 0.1% |
| 9275 | 3 | |
| 9274 | 1 | < 0.1% |
| 9272 | 2 | |
| 9270 | 1 | < 0.1% |
| 9268 | 1 | < 0.1% |
| 9267 | 2 |
ID_num
Real number (ℝ)
| Before imputation | After imputation | |
|---|---|---|
| Distinct | 8 | 8 |
| Distinct (%) | 0.1% | 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 1.5177729 | 1.5177729 |
| Before imputation | After imputation | |
|---|---|---|
| Minimum | 1 | 1 |
| Maximum | 8 | 8 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 68.0 KiB | 68.0 KiB |
Quantile statistics
| Before imputation | After imputation | |
|---|---|---|
| Minimum | 1 | 1 |
| 5-th percentile | 1 | 1 |
| Q1 | 1 | 1 |
| median | 1 | 1 |
| Q3 | 2 | 2 |
| 95-th percentile | 4 | 4 |
| Maximum | 8 | 8 |
| Range | 7 | 7 |
| Interquartile range (IQR) | 1 | 1 |
Descriptive statistics
| Before imputation | After imputation | |
|---|---|---|
| Standard deviation | 1.0542413 | 1.0542413 |
| Coefficient of variation (CV) | 0.69459753 | 0.69459753 |
| Kurtosis | 8.7092628 | 8.7092628 |
| Mean | 1.5177729 | 1.5177729 |
| Median Absolute Deviation (MAD) | 0 | 0 |
| Skewness | 2.7466168 | 2.7466168 |
| Sum | 13194 | 13194 |
| Variance | 1.1114248 | 1.1114248 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 6217 | |
| 2 | 1412 | 16.2% |
| 3 | 571 | 6.6% |
| 4 | 231 | 2.7% |
| 5 | 128 | 1.5% |
| 6 | 75 | 0.9% |
| 7 | 46 | 0.5% |
| 8 | 13 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 6217 | |
| 2 | 1412 | 16.2% |
| 3 | 571 | 6.6% |
| 4 | 231 | 2.7% |
| 5 | 128 | 1.5% |
| 6 | 75 | 0.9% |
| 7 | 46 | 0.5% |
| 8 | 13 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 6217 | |
| 2 | 1412 | 16.2% |
| 3 | 571 | 6.6% |
| 4 | 231 | 2.7% |
| 5 | 128 | 1.5% |
| 6 | 75 | 0.9% |
| 7 | 46 | 0.5% |
| 8 | 13 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 6217 | |
| 2 | 1412 | 16.2% |
| 3 | 571 | 6.6% |
| 4 | 231 | 2.7% |
| 5 | 128 | 1.5% |
| 6 | 75 | 0.9% |
| 7 | 46 | 0.5% |
| 8 | 13 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 6217 | |
| 2 | 1412 | 16.2% |
| 3 | 571 | 6.6% |
| 4 | 231 | 2.7% |
| 5 | 128 | 1.5% |
| 6 | 75 | 0.9% |
| 7 | 46 | 0.5% |
| 8 | 13 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 6217 | |
| 2 | 1412 | 16.2% |
| 3 | 571 | 6.6% |
| 4 | 231 | 2.7% |
| 5 | 128 | 1.5% |
| 6 | 75 | 0.9% |
| 7 | 46 | 0.5% |
| 8 | 13 | 0.1% |
Group_size
Real number (ℝ)
| Before imputation | After imputation | |
|---|---|---|
| Distinct | 8 | 8 |
| Distinct (%) | 0.1% | 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 2.0355458 | 2.0355458 |
| Before imputation | After imputation | |
|---|---|---|
| Minimum | 1 | 1 |
| Maximum | 8 | 8 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 68.0 KiB | 68.0 KiB |
Quantile statistics
| Before imputation | After imputation | |
|---|---|---|
| Minimum | 1 | 1 |
| 5-th percentile | 1 | 1 |
| Q1 | 1 | 1 |
| median | 1 | 1 |
| Q3 | 3 | 3 |
| 95-th percentile | 6 | 6 |
| Maximum | 8 | 8 |
| Range | 7 | 7 |
| Interquartile range (IQR) | 2 | 2 |
Descriptive statistics
| Before imputation | After imputation | |
|---|---|---|
| Standard deviation | 1.5963465 | 1.5963465 |
| Coefficient of variation (CV) | 0.78423511 | 0.78423511 |
| Kurtosis | 3.1670958 | 3.1670958 |
| Mean | 2.0355458 | 2.0355458 |
| Median Absolute Deviation (MAD) | 0 | 0 |
| Skewness | 1.8890173 | 1.8890173 |
| Sum | 17695 | 17695 |
| Variance | 2.5483222 | 2.5483222 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 4805 | |
| 2 | 1682 | 19.3% |
| 3 | 1020 | 11.7% |
| 4 | 412 | 4.7% |
| 5 | 265 | 3.0% |
| 7 | 231 | 2.7% |
| 6 | 174 | 2.0% |
| 8 | 104 | 1.2% |
| Value | Count | Frequency (%) |
| 1 | 4805 | |
| 2 | 1682 | 19.3% |
| 3 | 1020 | 11.7% |
| 4 | 412 | 4.7% |
| 5 | 265 | 3.0% |
| 7 | 231 | 2.7% |
| 6 | 174 | 2.0% |
| 8 | 104 | 1.2% |
| Value | Count | Frequency (%) |
| 1 | 4805 | |
| 2 | 1682 | 19.3% |
| 3 | 1020 | 11.7% |
| 4 | 412 | 4.7% |
| 5 | 265 | 3.0% |
| 6 | 174 | 2.0% |
| 7 | 231 | 2.7% |
| 8 | 104 | 1.2% |
| Value | Count | Frequency (%) |
| 1 | 4805 | |
| 2 | 1682 | 19.3% |
| 3 | 1020 | 11.7% |
| 4 | 412 | 4.7% |
| 5 | 265 | 3.0% |
| 6 | 174 | 2.0% |
| 7 | 231 | 2.7% |
| 8 | 104 | 1.2% |
| Value | Count | Frequency (%) |
| 1 | 4805 | |
| 2 | 1682 | 19.3% |
| 3 | 1020 | 11.7% |
| 4 | 412 | 4.7% |
| 5 | 265 | 3.0% |
| 6 | 174 | 2.0% |
| 7 | 231 | 2.7% |
| 8 | 104 | 1.2% |
| Value | Count | Frequency (%) |
| 1 | 4805 | |
| 2 | 1682 | 19.3% |
| 3 | 1020 | 11.7% |
| 4 | 412 | 4.7% |
| 5 | 265 | 3.0% |
| 6 | 174 | 2.0% |
| 7 | 231 | 2.7% |
| 8 | 104 | 1.2% |
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Interaction plot not present for dataset
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Interaction plot not present for dataset
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Interaction plot not present for dataset
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Interaction plot not present for dataset
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Interaction plot not present for dataset
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Interaction plot not present for dataset
Before imputation
After imputation
Before imputation
After imputation
Interaction plot not present for dataset
Before imputation
After imputation
Interaction plot not present for dataset
Before imputation
After imputation
Interaction plot not present for dataset
Before imputation
After imputation
Interaction plot not present for dataset
Before imputation
After imputation
Interaction plot not present for dataset
Before imputation
After imputation
Interaction plot not present for dataset
Before imputation
After imputation
Interaction plot not present for dataset
Before imputation
After imputation
Interaction plot not present for dataset
Before imputation
After imputation
Interaction plot not present for dataset
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Interaction plot not present for dataset
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Interaction plot not present for dataset
Before imputation
After imputation
Before imputation
After imputation
Before imputation
After imputation
Before imputation
| HomePlanet | CryoSleep | Destination | Age | VIP | RoomService | FoodCourt | ShoppingMall | Spa | VRDeck | Transported | Cabin_deck | Cabin_side | ID_group | ID_num | Group_size | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Europa | False | TRAPPIST-1e | 39.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | B | P | 1 | 1 | 1 |
| 1 | Earth | False | TRAPPIST-1e | 24.0 | False | 109.0 | 9.0 | 25.0 | 549.0 | 44.0 | 1 | F | S | 2 | 1 | 1 |
| 2 | Europa | False | TRAPPIST-1e | 58.0 | True | 43.0 | 3576.0 | 0.0 | 6715.0 | 49.0 | 0 | A | S | 3 | 1 | 2 |
| 3 | Europa | False | TRAPPIST-1e | 33.0 | False | 0.0 | 1283.0 | 371.0 | 3329.0 | 193.0 | 0 | A | S | 3 | 2 | 2 |
| 4 | Earth | False | TRAPPIST-1e | 16.0 | False | 303.0 | 70.0 | 151.0 | 565.0 | 2.0 | 1 | F | S | 4 | 1 | 1 |
| 5 | Earth | False | PSO J318.5-22 | 44.0 | False | 0.0 | 483.0 | 0.0 | 291.0 | 0.0 | 1 | F | P | 5 | 1 | 1 |
| 6 | Earth | False | TRAPPIST-1e | 26.0 | False | 42.0 | 1539.0 | 3.0 | 0.0 | 0.0 | 1 | F | S | 6 | 1 | 2 |
| 7 | Earth | True | TRAPPIST-1e | 28.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | NaN | 1 | G | S | 6 | 2 | 2 |
| 8 | Earth | False | TRAPPIST-1e | 35.0 | False | 0.0 | 785.0 | 17.0 | 216.0 | 0.0 | 1 | F | S | 7 | 1 | 1 |
| 9 | Europa | True | 55 Cancri e | 14.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1 | B | P | 8 | 1 | 3 |
After imputation
| HomePlanet | CryoSleep | Destination | Age | VIP | RoomService | FoodCourt | ShoppingMall | Spa | VRDeck | Cabin_deck | Cabin_side | ID_num | Group_size | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Europa | False | TRAPPIST-1e | 39.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | B | P | 1 | 1 |
| 1 | Earth | False | TRAPPIST-1e | 24.0 | False | 109.0 | 9.0 | 25.0 | 549.0 | 44.0 | F | S | 1 | 1 |
| 2 | Europa | False | TRAPPIST-1e | 58.0 | True | 43.0 | 3576.0 | 0.0 | 6715.0 | 49.0 | A | S | 1 | 2 |
| 3 | Europa | False | TRAPPIST-1e | 33.0 | False | 0.0 | 1283.0 | 371.0 | 3329.0 | 193.0 | A | S | 2 | 2 |
| 4 | Earth | False | TRAPPIST-1e | 16.0 | False | 303.0 | 70.0 | 151.0 | 565.0 | 2.0 | F | S | 1 | 1 |
| 5 | Earth | False | PSO J318.5-22 | 44.0 | False | 0.0 | 483.0 | 0.0 | 291.0 | 0.0 | F | P | 1 | 1 |
| 6 | Earth | False | TRAPPIST-1e | 26.0 | False | 42.0 | 1539.0 | 3.0 | 0.0 | 0.0 | F | S | 1 | 2 |
| 7 | Earth | True | TRAPPIST-1e | 28.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | G | S | 2 | 2 |
| 8 | Earth | False | TRAPPIST-1e | 35.0 | False | 0.0 | 785.0 | 17.0 | 216.0 | 0.0 | F | S | 1 | 1 |
| 9 | Europa | True | 55 Cancri e | 14.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | B | P | 1 | 3 |
Before imputation
| HomePlanet | CryoSleep | Destination | Age | VIP | RoomService | FoodCourt | ShoppingMall | Spa | VRDeck | Transported | Cabin_deck | Cabin_side | ID_group | ID_num | Group_size | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 8683 | Earth | False | TRAPPIST-1e | 21.0 | False | 86.0 | 3.0 | 149.0 | 208.0 | 329.0 | 0 | F | P | 9272 | 2 | 2 |
| 8684 | NaN | True | TRAPPIST-1e | 23.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1 | G | P | 9274 | 1 | 1 |
| 8685 | Europa | False | TRAPPIST-1e | 0.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1 | A | P | 9275 | 1 | 3 |
| 8686 | Europa | False | TRAPPIST-1e | 32.0 | False | 1.0 | 1146.0 | 0.0 | 50.0 | 34.0 | 0 | A | P | 9275 | 2 | 3 |
| 8687 | Europa | NaN | TRAPPIST-1e | 30.0 | False | 0.0 | 3208.0 | 0.0 | 2.0 | 330.0 | 1 | A | P | 9275 | 3 | 3 |
| 8688 | Europa | False | 55 Cancri e | 41.0 | True | 0.0 | 6819.0 | 0.0 | 1643.0 | 74.0 | 0 | A | P | 9276 | 1 | 1 |
| 8689 | Earth | True | PSO J318.5-22 | 18.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | G | S | 9278 | 1 | 1 |
| 8690 | Earth | False | TRAPPIST-1e | 26.0 | False | 0.0 | 0.0 | 1872.0 | 1.0 | 0.0 | 1 | G | S | 9279 | 1 | 1 |
| 8691 | Europa | False | 55 Cancri e | 32.0 | False | 0.0 | 1049.0 | 0.0 | 353.0 | 3235.0 | 0 | E | S | 9280 | 1 | 2 |
| 8692 | Europa | False | TRAPPIST-1e | 44.0 | False | 126.0 | 4688.0 | 0.0 | 0.0 | 12.0 | 1 | E | S | 9280 | 2 | 2 |
After imputation
| HomePlanet | CryoSleep | Destination | Age | VIP | RoomService | FoodCourt | ShoppingMall | Spa | VRDeck | Cabin_deck | Cabin_side | ID_num | Group_size | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 8683 | Earth | False | TRAPPIST-1e | 21.0 | False | 86.0 | 3.0 | 149.0 | 208.0 | 329.0 | F | P | 2 | 2 |
| 8684 | Earth | True | TRAPPIST-1e | 23.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | G | P | 1 | 1 |
| 8685 | Europa | False | TRAPPIST-1e | 0.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | A | P | 1 | 3 |
| 8686 | Europa | False | TRAPPIST-1e | 32.0 | False | 1.0 | 1146.0 | 0.0 | 50.0 | 34.0 | A | P | 2 | 3 |
| 8687 | Europa | False | TRAPPIST-1e | 30.0 | False | 0.0 | 3208.0 | 0.0 | 2.0 | 330.0 | A | P | 3 | 3 |
| 8688 | Europa | False | 55 Cancri e | 41.0 | True | 0.0 | 6819.0 | 0.0 | 1643.0 | 74.0 | A | P | 1 | 1 |
| 8689 | Earth | True | PSO J318.5-22 | 18.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | G | S | 1 | 1 |
| 8690 | Earth | False | TRAPPIST-1e | 26.0 | False | 0.0 | 0.0 | 1872.0 | 1.0 | 0.0 | G | S | 1 | 1 |
| 8691 | Europa | False | 55 Cancri e | 32.0 | False | 0.0 | 1049.0 | 0.0 | 353.0 | 3235.0 | E | S | 1 | 2 |
| 8692 | Europa | False | TRAPPIST-1e | 44.0 | False | 126.0 | 4688.0 | 0.0 | 0.0 | 12.0 | E | S | 2 | 2 |
Before imputation
| HomePlanet | CryoSleep | Destination | Age | VIP | RoomService | FoodCourt | ShoppingMall | Spa | VRDeck | Transported | Cabin_deck | Cabin_side | ID_group | ID_num | Group_size | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Dataset does not contain duplicate rows. | |||||||||||||||||
After imputation
| HomePlanet | CryoSleep | Destination | Age | VIP | RoomService | FoodCourt | ShoppingMall | Spa | VRDeck | Cabin_deck | Cabin_side | ID_num | Group_size | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 169 | Earth | True | TRAPPIST-1e | 14.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | G | S | 1 | 1 | 13 |
| 181 | Earth | True | TRAPPIST-1e | 18.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | G | S | 1 | 1 | 13 |
| 193 | Earth | True | TRAPPIST-1e | 22.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | G | P | 1 | 1 | 13 |
| 85 | Earth | True | PSO J318.5-22 | 16.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | G | S | 1 | 1 | 12 |
| 170 | Earth | True | TRAPPIST-1e | 15.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | G | P | 1 | 1 | 12 |
| 180 | Earth | True | TRAPPIST-1e | 18.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | G | P | 1 | 1 | 12 |
| 184 | Earth | True | TRAPPIST-1e | 19.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | G | P | 1 | 1 | 12 |
| 195 | Earth | True | TRAPPIST-1e | 22.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | G | S | 1 | 1 | 12 |
| 98 | Earth | True | PSO J318.5-22 | 22.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | G | P | 1 | 1 | 11 |
| 172 | Earth | True | TRAPPIST-1e | 15.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | G | S | 1 | 1 | 11 |